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££y? T ™ L V ZABLE COMPOSITIONS COMPRISING A HEPATITIS C VIRUS NS3 PROTFacp 
DOMAIN/NS4A COMPLEX AND CRYSTALS THEREBY OBTAINED PROTEASE 

TECHNICAL FIELD OF INVENTION 

The present invention relates to compositions 
and crystals of a hepatitis C virus protease in complex 
with its viral cof actor. This invention also relates 
to methods of using the structure coordinates of 
hepatitis C virus protease in complex with a synthetic 
NS4A to solve the structure of similar or homologous 
proteins or protein complexes. 

15 BACKGROU ND OF THR INVENT TOM 

Infection by hepatitis C virus (HCV) is a 
compelling human medical problem. HCV is recognized as 
the causative agent for most cases of non-A, non-B 
20 hepatitis, with an estimated human seroprevalence of 1% 
globally [Choo, Q.-L. et al., "Isolation of a cDNA 
Clone Derived From a Blood-Borne Non-A, Non-B Viral 
Hepatitis Genome", Science , 244, pp. 359-362 (1989); 
Kuo, G. et al., "An Assay for Circulating Antibodies to 
25 a Major Etiologic Virus of Human Non-A, Non-B 

Hepatitis", Science , 244, pp. 362-364 (1989); Purcell, 
R.H., "Hepatitis C virus: Historical perspective and 
current concepts", FEM5 Microbiology Reviews , 14, pp. 
181-192 (1994); Van der Poel, C.L., "Hepatitis C Virus. 
Epidemiology, Transmission and Prevention in Hepatitis 
C virus. Current Studies in Hematology and Blood 
Transfusion, H.W. Reesink, Ed., (Basel: Karger) , pp. 
137-163 (1994)]. Four million individuals may be 
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infected in the United States alone (Alter, M.J. and 
Mast, E.E., "The Epidemiology of Viral Hepatitis in 
the United States, Gastroenterol. Clin. North Am. , 23, 
pp. 437-455 (1994) ] . 
5 Upon first exposure to HCV only about 20% of 

infected individuals develop acute clinical hepatitis 
while others appear to resolve the infection 
spontaneously. In most instances, however, the virus 
establishes a chronic infection that persists for 
10 decades [Iwarson, S. "The Natural Course of Chronic 

Hepatitis"/ FEM5 Microbiology Reviews , 14, pp. 2C1-204 
(1994)]. This usually results in recurrent and 
progressively worsening liver inflammation, which often 
leads to more severe disease states such as cirrhosis 
15 and hepatocellular carcinoma [Kew, M.C., "Hepatitis C 

and Hepatocellular Carcinoma", FEMS Microbiology 
Reviews , 14,- pp. 211-220 (1994); Saito, I., et al . 
"Hepatitis C Virus Infection is Associated with the 
Development cf Hepatocellular Carcinoma", Proc. Natl. 
20 Acad. Sci. USA 87, pp. 6547-6549 (1990)). Currently, 

there are no broadly effective treatments for the 
debilitating progression of chronic HCV. 

The HCV genome encodes a polyprotein of 3010- 
3033 amino acids (Figure 1) [Choo, Q.-L., et al . 
25 "Genetic Organization and Diversity of the Hepatitis C 

Virus", Proc. Natl. Acad. Sci. USA , 88, pp. 2451-2455 
(1991); Kato, N. et al., Molecular Cloning of the Human 
Hepatitis C Virus Genome From Japanese Patients with 
Non~A, Non-B Hepatitis", Proc. Natl. Acad. Sci. USA , 
30 87, pp. 9524-9528 (1990); Takamizawa, A. et al., 

"Structure and Organization of the Hepatitis C Virus 
Genome Isolated From Human Carriers", J. Virol. , 65, 
pp. 1105-1113 (1991)). The HCV nonstructural (NS) 
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proteins provide catalytic machinery for viral 
replication. The NS proteins are derived by 
proteolytic cleavage of the polyprotein 

[Bartenschlager, R . et al . , "Nonstructural Protein 3 of 
the Hepatitis C Virus Encodes a Serine-Type Proteinase 
Required for Cleavage at the NS3/4 and NS4/5 
Junctions", J. Virol. , 67, pp. 3835-3844 (1993); 
Grakoui, A. et al . "Characterisation of the Hepatitis C 
Virus-Encoded Serine Proteinase: Determination of 
Proteinase-Dependent Polyprotein Cleavage Sites", J. 
Viro1 " 67 ' PP- 2832-2843 (1993); Grakoui, A. et ' al7. 
Expression and Identification of Hepatitis C Virus 
Polyprotein Cleavage Products", J. Virol. , 67, pp 
1385-1395 (1993); Tomei, L. et al . , «NS3 is a'serine 
protease required for processing of hepatitis C virus 
polyprotein", J . Virol. , 67, pp. 4017-4026 (1993)]. 

The HCV NS protein 3 (NS3) contains a serine 
protease activity that helps process the majority of 
the viral enzym.es, and is thus considered essential for 
viral replication and infectivity. It is known that 
mutations m the yellow fever virus NS3 protease 
decreases viral infectivity [Chambers, T.J. et. al . , 
"Evidence that the N- terminal Domain of Nonstructural 
Protein NS3 From Yellow Fever Virus is a Serine 
Protease Responsible for Site-Specific Cleavages in the 
Viral Polyprotein", Pr oc. Natl. Acad. Sci . hsa . 87, PP 
8898-8902 (1990)]. The f irst 181 amino acids of NS3 
(residues 1027-1207 of the viral polyprotein) have been 
shown to contain the serine protease domain of NS3 that 
processes all four downstream sites of the HCV 
polyprotein (Figure 1) [C . Lin et al., "Hepatitis C 
Virus NS3 Serine Proteinase: Trans-Cleavage 



PCT7US97/16182 

WO 98/11134 

- 4 - 

Requirements and Processing Kinetics", J. Virol. , 68, 
pp. 8147-8157 (1994)]. 

NS3 is associated with a cofactor, NS4A. 
NS4A seems critical to the activity of NS3, enhancing 
5 the proteolytic efficiency of NS3 at all of the 
cleavage sites, NS4A is a 54 residue amphipathic 
peptide, with a hydrophobic N-terminus and a 
hydrophilic C-terminus [Failla, C. et al . , "Both NS3 
and NS4A are Required for Proteolytic Processing of 

10 Hepatitis C Virus Nonstructural Proteins", J. Virol. , 
68, pp. 3753-3760 (1994)]. Its function appears 
complex, possibly assisting in the membrane- 
localization of NS3 and ether viral replicase 
components [Lin, C. et al . "A Central Region in the 

15 Hepatitis C Virus NS4A Protein Allows Formation of an 
Active NS3-NS4A Serine Proteinase Complex In Vivo and 
In Vitro ", J. Virol, , 69, pp. 4373-4380 (1995b); 
Shimizu, Y. et al . , "Identification cf the Sequence on 
NS4A Required for Enhanced Cleavage of the NS5A/5B Site 

20 by Hepatitis C Virus NS3 Protease", J. Virol. , 70, pp. 

127-132 (1996); Tanji, Y. et al., "Hepatitis C Virus- 
Encoded Nonstructural Protein NS4A has Versatile 
Functions in Viral Protein Processing", J. Virol. , 69, 
pp. 1575-1581 (1995)] but its best characterized 

25 function is that of a cofactor for the NS3 protease. 

The current understanding of HCV has not led 
to satisfactory treatments for HCV infection. The 
prospects for effective anti-HCV vaccines remain 
uncertain. The only established therapy for HCV 

30 disease is interferon treatment. However, interferons 

have significant side effects [Janssen, H. L. A., et 
al. "Suicide Associated with Alf a-Interf eron Therapy 
for Chronic Viral Hepatitis", J. Hepatol. , 21, pp. 241- 
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243 (1994)]/ Renault, P.F. and Hoofnagle, J.H., "Side 
effects of alpha interferon. Seminars in Liver Disease 
9, 273-277. (1989)] and induce long term remission in 
only a fraction (~ 251) of cases [Weiland, 0. 
5 "Interferon Therapy in Chronic Hepatitis C Virus 
Infection", FEMS Microbiol. Rev. , 14, pp. 279-288 
(1994)]. Thus, there is a need for more effective 
anti-HCV therapies. 

The NS3 protease is considered a potential 
target for antiviral agents. However, drug discovery 
efforts directed towards the NS3 protein have been 
hampered by the lack of structural information about 
NS3 and its complex with NS4A. Such structural 
information would provide valuable information in 
15 discovery of HCV N33 protease inhibitors. However, 

efforts to determine the structure of HCV NS3 protease 
have been hampered by difficulties in obtaining 
sufficient quantities of pure active enzyme 
[Steinkuhler, C. et al . , "in Vitro Activity of 
Hepatitis C Virus Protease N53 Purified from 
Recombinant Baculovirus-Inf ected Sf9 Cells", j. B iol 
Chertu, pp. 637-6273 (1996)]. There have been~n^ 
crystals reported of any NS3 or NS3 protease domain 
protein. Thus, x-ray crystailographic analysis of such 
25 proteins has not been possible. 

SUMMARY OF THE INVENT I ON 
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Applicants have solved this problem by 
providing, for the first time, compositions comprising 
a hepatitis C virus (HCV) NS3 protease-like polypeptide 
complexed with a NS4A-l ike peptide and methods for 
making such compositions. 
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. The invention also provides crystals of a HCV 
NS3 protease-like polypeptide/NS4A-like peptide complex 
and methods for making such crystals. 

The invention also provides the structure 
coordinates of a HCV NS3 protease-like 
polypeptide/NS4A-like peptide complex. 

The invention also provides a method for 
determining at least a portion of the three-dimensional 
structure of molecules or molecular complexes which 
contain at least some structurally similar features to 
a HCV NS3 serine protease domain. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts HCV polyprotein processing. 
The locations of the HCV structural and nonstructural 
proteins are marked on a diagram of the 3011 amino acid 
polypeptide. Cleavages between the structural proteins 
by cellular signal peptidases are marked by asterisks. 
Cleavage between MS 2 and NS3 is mediated by the NS2/NS3 
metallo-protease . The NS3 serine protease is 
responsible for cleavages between NS3 and NS4A, NS4A 
and NS4B, NS4B and NS5A, and NS5A and NS5B . 

Figure 2 depicts stereo ribbon diagrams of 
the NS3/NS4A complex. The view is into the active site 
cleft of the enzyme. Side-chains of active site 
residues His-1083, Asp-1107, and Ser-1165, along with 
Zn ++ ligands Cys-1123, Cys-1125, and Cys-1171 are 
displayed in bal] -and-st ick representation. Zn ++ , its 
H20 ligand, and the p-strand formed by NS4A are also 
shown . 
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Figure 3 lists the atomic structure 
coordinates for hepatitis C virus recombinant, 
truncated nonstructural protein 3 (hereafter referred 
to as tNS3) in complex with a synthetic peptide of the 
central region of the nonstructural protein 4A 
(hereafter referred to as sNS4A) as derived by X-ray 
diffraction from crystals of that complex (hereafter 
referred to as tNS3/sNS4A) . The preparation of the 
complex is described in Examples 1 and 2 . The 
following abbreviations are used in Figure 3: 

"Atom type" refers to the element whose 
coordinates have been determined. Elements are defined 
by the first letter in the column except for zinc which 
is defined by the letters "Zn". 

"X, Y, Z" crystallographically define the 
atomic position determined for each atom. 

"B" is a thermal factor that measures 
movement of the atom around its atomic center. 

"Occ" is an occupancy factor that refers to 
the fraction of the molecules in which each atom " 
occupies the position specified by the coordinates A 
value of "1" indicates that each atom has the same 
conformation, i.e., the same position, in ail molecules 
of the crystal. 
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Figure 4 shows a diagram of a system used to 
carry out the instructions encoded by the storage 
medium of Figures 5 and 6. 

Figure 5 shows a cross section of a magnetic 
storage medium. 

Figure 6 shows a cross section of a 
optically-readable data storage medium. 
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DETAILED DESCRIPTION OF THE INVENTION 
The following abbreviations are used 
throughout the application: 



A 




Ala 




Alanine 


T 




Thr 




Threonine 


V 




Val 




Valine 


C 




Cys 




Cysteine 


L 




Leu 




Leucine 


Y 




Tyr 




Tyrosine 


I 




lie 




Isoleucine 


N 




Asn 




Asparagine 


P 




Pro 




Proline 


Q 




Gin 




Glutamine 


F 




Phe 




Phenylalanine 


D 




Asp 




Aspartic Acid 


W 




Trp 




Tryptophan 


E 




Glu 




Glutamic Acid 


M 




Met 




Methionine 


K 




Lys 




Lysine 


G 




Gly 




Glycine 


R 




Arg 




Arginine 


S 




Ser 




Serine 


H 




His 




Histidine 



HCV = hepatitis C virus 

Additional definitions are set forth in the 
5 specification where necessary. 

In order that the invention described herein 
may be more fully understood, the following detailed 
description is set forth. 

Applicants have solved the above problems by 
10 providing, for the first time, crystallizable 

compositions comprising a HCV NS3 protease-like 
polypeptide in complex with a NS4A-like peptide. 

Thus, in one embodiment of this invention is 
provided a composition comprising a hepatitis C virus 
15 NS3-like polypeptide in complex with an NS4A-like 

peptide . 

The HCV NS3-like polypeptide portion of the 
complex is any polypeptide which has the serine 
protease activity of the naturally occurring HCV NS3A 
20 protease, particularly the ability to cleave the HCV 



WO 98/11134 



PCT/US97/16182 



- 9 - 



10 



polyprotein. It includes HCV NS3, NS3 protease domain 
polypeptides and NS3 protease domain-like polypeptides. 

As used herein, the terms "HCV NS3" and "NS3" 
refers to the hepatitis C virus nonstructural-.! protein 
as defined in Lin, C. et al., "Hepatitis C Virus NS3 
Serine Proteinase: Trans-Cleavage Requirements and 
Processing Kinetics", J. Virol. , 68, pp. 8147-8157 
(1994) . 

The term "NS3 protease domain polypeptide" 
refers to a truncated, serine protease portion of NS3 
as defined in [Bartenschlager, R. et al., 
"Nonstructural Protein 3 of the Hepatitis C Virus 
Encodes a Serine-Type Proteinase Required for Cleavage 
at the NS3/4 and NS4/5 Junctions", J. Virol. , 67, pp. 
15 3835-3844 (1993); Grakoui, A. et al . "Characterisation 

of the Hepatitis c Virus-Encoded Serine Proteinase: 
Determination of Proteinase-Dependent Polyprotein 
Cleavage Sites", J. Virol. , 67, pp. 2832-2843 (1993); 
Grakoui, A. et al . , Expression and Identification of 
Hepatitis C Virus Polyprotein Cleavage Products", J. 
Viro1 -' 67 ' PP- 1385-1395 (1993); Tomei, L. et al.7" 
"NS3 is a serine protease required for processing of 
hepatitis C virus polyprotein", J. Virol. , 57, pp. 
4017-4026 (1993)]. The disclosure cf each of these 
25 documents is herein incorporated by reference. 

The term "NS3 protease domain-like 
polypeptides" refers to polypeptides that differ from 
NS3 protease domain polypeptides by having amino acid 
deletions, substitutions, and additions, but which 
retain the serine protease activity of NS3. 

Preferably, the NS3-like polypeptide in the 
compositions of this invention is tNS3, a recombinant^ 
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produced hepatitis C virus protease domain protein that 
is prepared as described herein. 

The NS4A-like peptide portion of the 
compositions of this invention is any peptide or 
5 peptide mimetic that is capable of acting as a NS4A 

cof actor for the NS3. These include NS4A, peptide 
fragments thereof and other peptides that differ from 
NS4A by having amino acid deletions, substitutions, and 
additions, while retaining the above-described 
10 activity. 

As used herein the term "NS4A" refers to the 
hepatitis C virus nonstructural protein 4A which acts 
as a cofactor for NS3 protease [Failla, C. et al., 
"Both NS3 and NS4A are Required for Proteolytic 
15 Processing of Hepatitis C Virus Nonstructural Proteins" 

J. Virol, 68, pp. 3753-3760 (1994); Lin, C. et al., 
^Hepatitis C Virus NS3 Serine Proteinase: Trans- 
Cleavage Requirements and Processing Kinetics" J . 
Virol . 68, pp. 8147-8157 (1994b)] 
20 Preferably, the NS4A-like peptide is sNS4A, 

the synthetic peptide H-KKGSWIVGRIVLSGKPAIIPKK-OH. 
This peptide encompasses the essential NS3 protease 
domain residues of NS4A. 

Both the NS3-like polypeptide and the NS4A- 
25 like peptide may be produced by any well-known method/ 

including synthetic methods, such as solid phase, 
liquid phase and combination solid phase/liquid phase 
syntheses; recombinant DNA methods, including cDNA 
cloning, optionally combined with site directed 
30 mutagenesis; and/or purification of the natural 

products, optionally combined with enzymatic cleavage 
methods to produce fragments of naturally occurring NS3 
and NS4A. 
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According to a preferred embodiment, the 
compositions of this invention are crystallizable . In 
this preferred embodiment all of the preferred choices 
for the NS3-like polypeptide and the NS4A-like peptide 
5 are identical to those indicated above. 

Advantageously, the crystallizable 
composition provided by this invention are amenable to 
x-ray crystallography. Thus, this invention also 
provides the three-dimensional structure of an HCV NS3- 

10 like polypeptide/NS4A-like peptide complex, 

specifically an HCV tNS3/sNS4A complex, at 2.5 A 
resolution. Importantly, this has provided for the 
first time, information about the shape and structure 
of the NS3 protease domain. 

15 The three-dimensional structure of the HCV 

tNS3/sNS4A complex of this invention is defined by a 
set of structure coordinates as set forth in Figure 3. 
The term "structure coordinates" refers to Cartesian 
coordinates derived from mathematical equations related 

20 to the patterns obtained on diffraction of a 

monochromatic beam of X-rays by the atoms (scattering 
centers) of an tNS3/sNS4A complex in crystal form. The 
diffraction data are used to calculate an electron 
density map of the repeating unit of the crystal. The 

25 electron density maps are then used to establish the 
positions of the individual atoms of the tNS3/sNS4A 
enzyme or enzyme complex. 

Those of skill in the art will understand 
that a set of structure coordinates for an enzyme or an 

30 enzyme-complex or a portion thereof, is a relative set 
of points that define a shape in three dimensions. 
Thus, it is possible that an entirely different set of 
coordinates could define a similar or identical shape. 
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Moreover, slight variations in the individual 
coordinates will have little effect on overall shape. 

The variations in coordinates discussed above 
may be generated because of mathematical manipulations 
5 of the structure coordinates. For example, the 

structure coordinates set forth in Figure 3 could be 
manipulated by crystallographic permutations of the 
structure coordinates, f ractionalization of the 
structure coordinates, integer additions or 

10 subtractions to sets of the structure coordinates, 

inversion of the structure coordinates or any 
combination of the above. 

Alternatively, modifications in the crystal 
structure due to mutations, additions, substitutions, 

15 and/or deletions of amino acids, or other changes in 

any of the components that make up the crystal could 
also account for variations in structure coordinates. 
If such variations are within an acceptable standard 
error as compared to the original coordinates, the 

20 resulting three-dimensional shape is considered to be 

the same. 

Various computational analyses are therefore 
necessary to determine whether a molecule or molecular 
complex or a portion thereof is sufficiently similar to 

25 all or parts of the NS3-like polypeptide/NS4A-like 

peptide structure described above as to be considered 
the same. Such analyses may be carried out in current 
software applications, such as the Molecular Similarity 
application of QUANTA (Molecular Simulations Inc., San 

30 Diego, CA) version 4.1, and as described in the 

accompanying User's Guide. 

The Molecular Similarity application permits 
comparisons between different structures, different 
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conformations of the same structure, and different 
parts of the same structure. The procedure used in 
Molecular Similarity to compare structures is divided 
into four steps: 1) load the structures to be 
5 compared; 2) define the atom equivalences in these 
structures; 3) perform a fitting operation; and 4) 
analyze the results. 

Each structure is identified by a name. One 
structure is identified as the target (i.e., the fixed 

10 structure) ; all remaining structures are working 
structures (i.e., moving structures). Since atom 
equivalency within QUANTA is defined by user input, for 
the purpose of this invention we will define equivalent 
atoms as protein backbone atoms (N, Cot, C and O) for 

15 all conserved residues between the two structures being 
compared. We will also consider only rigid fitting 
operations . 

When a rigid fitting method is used, the 
working structure is translated and rotated to obtain 

20 an optimum fit with the target structure. The fitting 
operation uses an algorithm that computes the optimum 
translation and rotation to be applied to the moving 
structure, such that the root mean square difference of 
the fit over the specified pairs of equivalent atom is 

25 an absolute minimum. This number, given in angstroms, 
is reported by QUANTA. 

For the purpose of this invention, any 
molecule or molecular complex that has a root mean 
square deviation of conserved residue backbone atoms 

30 (N, Ca, C, O) of less than 1.5 A when superimposed on 

the relevant backbone atoms described by structure 
coordinates listed in Figure 3 are considered 
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identical. More preferably, the root mean square 
deviation is less than 1.0 A, 

The term "root mean square deviation" means 
the square root of the arithmetic mean of the squares 
5 of the deviations from the mean. It is a way to 

express the deviation or variation from a trend or 
object. For purposes of this invention, the "root mean 
square deviation'' defines the variation in the backbone 
of a protein or protein complex from the relevant 
10 portion of the backbone of the NS3-like polypeptide 
portion of the complex as defined by the structure 
coordinates described herein. 

Once the structure coordinates of a protein 
crystal have been determined they are useful in solving 
15 the structures of other crystals. 

Thus, in accordance with the present 
invention, the structure coordinates of a NS3-like 
polypeptide/NS4A-like peptide complex, and in 
particular a tNS3/sNS4A complex, and portions thereof 
20 is stored in a machine-readable storage medium. Such 

data may be used for a variety of purposes, such as 
drug discovery and x-ray crystallographic analysis or 
protein crystal. 

Accordingly, in one embodiment of this 
25 invention is provided a machine-readable data storage 

medium comprising a data storage material encoded with 
the structure coordinates set forth in Figure 3. 

Figure 4 demonstrates one version of these 
embodiments. System 10 includes a computer 11 
30 comprising a central processing unit ("CPU") 20, a 

working memory 22 which may be, e.g, RAM (random-access 
memory) or "core" memory, mass storage memory 24 (such 
as one or more disk drives or CD-ROM drives), one or 
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more cathode-ray tube ( "CRT 11 ) display terminals 2 6, one 
or more keyboards 28, one or more input lines 30 f and 
one or more output lines 40, all of which are 
interconnected by a conventional bidirectional system 
5 bus 50. 

Input hardware 36, coupled to computer 11 by 
input lines 30, may be implemented in a variety of 
ways. Machine-readable data of this invention may be 
inputted via the use of a modem or modems 32 connected 

10 by a telephone line or dedicated data line 34. 

Alternatively or additionally, the input hardware 36 
may comprise CD-ROM drives or disk drives 24. In 
conjunction with display terminal 26, keyboard 28 may 
also be used as an input device. 

15 Output hardware 4 6, coupled to computer 11 by 

output lines 40, may similarly be implemented by 
conventional devices. By way of example, output 
hardware 4 6 may include CRT display terminal 2 6 for 
displaying a graphical representation of a binding 

20 pocket of this invention using a program such as QUANTA 
as described herein. Output hardware might also 
include a printer 42, so that hard copy output may be 
produced, or a disk drive 24, to store system output 
for later use. 

25 In operation, CPU 20 coordinates the use of 

the various input and output devices 36, 4 6, 
coordinates data accesses from mass storage 24 and 
accesses to and from working memory 22, and determines 
the sequence of data processing steps. A number of 

30 programs may be used to process the machine-readable 

data of this invention. Such programs are discussed in 
reference to the computational methods of drug 
discovery as described herein. Specific references to 
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components of the hardware system 10 are included as 
appropriate throughout the following description of the 
data storage medium. 

Figure 5 shows a cross section of a magnetic 
5 data storage medium 100 which can be encoded with a 
machine-readable data that can be carried out by a 
system such as system 10 of Figure 4. Medium 100 can 
be a conventional floppy diskette or hard disk, having 
a suitable substrate 101 , which may be conventional, 

10 and a suitable coating 102, which may be conventional, 
on one or both sides, containing magnetic domains (not 
visible) whose polarity or orientation can be altered 
magnetically. Medium 100 may also have an opening (not 
shown) for receiving the spindle of a disk drive or 

15 other data storage device 24. 

The magnetic domains of coating 102 of medium 
100 are polarized or oriented so as to encode in manner 
which may be conventional, machine readable data such 
as that: described herein, for execution by a system 

20 such as system 10 of Figure 4. 

Figure 6 shows a cross section of an 
optically-readable data storage medium 110 which also 
can be encoded with such a machine-readable data, or 
set of instructions, which can be carried out by a 

25 system such as system 10 of Figure 4. Medium 110 can 
be a conventional compact disk read only memory 
(CD-ROM) or a rewritable medium such as a 
magneto-optical disk which is optically readable and 
magneto-optically writable. Medium 100 preferably has 

30 a suitable substrate 111, which may be conventional, 

and a suitable coating 112, which may be conventional, 
usually of one side of substrate 111. 
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In the case of CD-ROM, as is well known, 
coating 112 is reflective and is impressed with a 
Plurality of pits 113 to encode the machine _ readable 
data. The arrangement of pits is read by reflecting 
laser light off the surface of coating ll 2 . A 
protective coating 114, which preferably i 5 
substantially transparent, is provided on top of 
coating 112. 

In the. case of a magneto-optical disk, as is 
well known, coating 112 has no pits 113, but has a 
plurality of magnetic domains whose polarity or 
orientation can be changed magnetically when heated 
above a certain temperature, as by a laser (not shown) . 
The orientation of the domains can be read by measuring 
the polarization of laser light reflected from coating 
112. The arrangement of the domains encodes the data 
as described above. 

For the first time, the present invention 
perm it5 the use of structure-based or rational drug 
desxgn techniques to design, select, and synthesize 
cnemicai entities, including inhibitory compounds that 
are capable of binding to HCV NS3, NS4A, NS3/NS4A 
complex, or any portion thereof. 

One particularly useful drug design technique 
enabled by this invention is iterative drug design 
Iterative drug design is a met hod for optimizing 
associations between a protein and a compound by 
determining and evaluating the three-dimensional 
structures of successive sets of protein/compound 
complexes . 

Those of skill in the art will realize that 
association of natural li gands or substrates wifch the 
binding pockets of their corresponding receptors or 
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enzymes is the basis of many biological mechanisms of 
action. The term "binding pocket", as used herein, 
refers to a region of a molecule or molecular complex, 
that, as a result of its shape, favorably associates 
5 with another chemical entity or compound. Similarly, 
many drugs exert their biological effects through 
association with the binding pockets of receptors and 
enzymes. Such associations may occur with all or any 
parts of the binding pockets. An understanding of such 

10 associations will help lead to the design of drugs 

having more favorable associations with their target 
receptor or enzyme, and thus, improved biological 
effects. Therefore, this information is valuable in 
designing potential ligands or inhibitors of receptors 

15 or enzymes, such as inhibitors of HCV NS3-like 

polypeptides, and more importantly HCV NS3. 

The term "associating with" refers to a 
condition of proximity between chemical entities or 
compounds, or portions thereof. The association may be 

20 non-covalent — wherein the juxtaposition is 

energetically favored by hydrogen bonding or van der 
Waals or electrostatic interactions or it may be 
covalent . 

In iterative drug design, crystals of a 
2 5 series of protein/compound complexes are obtained and 

then the three-dimensional structures of each complex 
is solved. Such an approach provides insight into the 
association between the proteins and compounds of each 
complex. This is accomplished by selecting compounds 
30 with inhibitory activity, obtaining crystals of this 

new protein/compound complex, solving the three- 
dimensional structure of the complex, and comparing the 
associations between the new protein/compound complex 
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and previously solved protein/compound complexes. By 
observing how changes in the compound affected the 
protein/compound associations/ these associations may 
be optimized, 

5 In some cases, iterative drug design is 

carried out by forming successive protein-compound 
complexes and then crystallizing each new complex. 
Alternatively/ a pre-formed protein crystal is soaked 
in the presence of an inhibitor, thereby forming a 

10 protein/compound complex and obviating the need to 

crystallize each individual protein/compound complex. 
Advantageously, the HCV NS3~like polypeptide/NS4A-like 
peptide crystals, and in particular the tNS3/sNS4A 
crystals, provided by this invention may be soaked in 

15 the presence of a compound or compounds, such as NS3 
protease inhibitors, to provide NS3-like 
polypeptide/NS4A-like peptide /compound crystal 
complexes . 

As used herein, the term ^soaked" refers to 
20 process in which the crystal is transferred to a 

solution containing the compound of interest. 

In another embodiment of this invention is 

provided a method for preparing a composition 

comprising a NS3-like polypeptide protein comprising 
25 the steps described in Examples 1 and 2. Preferably, 

the composition comprises a NS3-like polypeptide in 

complex with a NS4A-like peptide. 

The structure coordinates set forth in Figur 

3 can also be used to aid in obtaining structural 
30 information about another crystallized molecule or 

molecular complex. This may be achieved by any of a 

number of well-known techniques, including molecular 

replacement . 
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The structure coordinates set forth in Figure 
3 can also be used for determining at least a portion 
of the three-dimensional structure of molecules or 
molecular complexes which contain at least some 
5 structurally similar features to HCV MS 3 . In 

particular, structural information about another 
crystallized molecule or molecular complex may be 
obtained . This may be achieved by any of a number of 
well-known techniques, including molecular replacement. 
10 Therefore, in another embodiment this 

invention provides a method of utilizing molecular 
replacement to obtain structural information about a 
crystallized molecule or molecular complex whose 
structure is unknown comprising the steps of: 
15 a) generating an X-ray diffraction pattern 

from said crystallized molecule or molecular complex; 
and 

b) applying at least a portion of the 
structure coordinates set forth in Figure 3 to the 
20 X-ray diffraction pattern to generate a 

three-dimensional electron density map of the molecule 
or molecular complex whose structure is unknown. 

Preferably, the crystallized molecule or 
molecular complex comprises a NS3-like polypeptide and 
25 a NS4A-like peptide. More preferably, the crystallized 

molecule or molecular complex is obtained by soaking a 
crystal of this invention in a solution. 

By using molecular replacement, all or part 
of the structure coordinates of the tNS3/sNS4A complex 
30 provided by this invention (and set forth in Figure 3) 

can be used to determine the structure of a 
crystallized molecule or molecular complex whose 
structure is unknown more quickly and efficiently than 
attempting to determine such information ab initio . 



WO 98/1 1 134 PCT/US97/16182 



- 21 - 



Molecular replacement provides an accurate 
estimation of the phases for an unknown structure. 
Phases are a factor in equations used to solve crystal 
structures that can not be determined directly. 
Obtaining accurate values for the phases, by methods 
other than molecular replacement, is a time-consuming 
process that involves iterative cycles of 
approximations and refinements and greatly hinders the 
solution of crystal structures. However, when the 
crystal structure of a protein containing at least a 
homologous portion has been solved, the phases from the 
known structure provide a satisfactory estimate of the 
phases for the unknown structure. 

Thus, this method involves generating a 
preliminary model of a molecule or molecular complex 
whose structure coordinates are unknown, by orienting 
and positioning the relevant portion of the tNS3/sNS4A 
complex according to Figure 3 within the unit cell of 
the crystal of the unknown molecule or molecular 
complex so as best to account for the observed X-ray 
diffraction pattern of the crystal of the molecule or 
molecular complex whose structure is unknown. Phases 
can then be calculated from this model and combined 
with the observed X-ray diffraction pattern amplitudes 
to generate an electron density map of the structure 
whose coordinates are unknown. This, in turn, can be 
subjected to any well-known model building and 
structure refinement techniques to provide a final, 
accurate structure of the unknown crystallized molecule 
or molecular complex [E. Lattman, "Use of the Rotation 
and Translation Functions", in Meth. Enzymol ., 115, pp. 
55-77 (1985); M. G. Rossmann, ed., "The Molecular 
Replacement Method", Int. Sci . Rev. Ser. , No. 13, 
Gordon & Breach, New York (1972)]. 



WO 98/11134 



PCTYUS97/16182 



- 22 - 

The structure of any portion of any 
crystallized molecule or molecular complex that is 
sufficiently homologous to any portion of the 
tNS3/sNS4A complex can be solved by this method. 
5 In a preferred embodiment, the method of 

molecular replacement is utilized to obtain structura 
information about a molecule or molecular complex, 
wherein the complex comprises a NS3-like polypeptide. 
Preferably the NS3-like polypeptide is tNS3 or 

10 homologue thereof . 

The structure coordinates of tNS3/sNS4A as 
provided by this invention are particularly useful in 
solving the structure of other crystal forms of NS3- 
like polypeptide, preferably other crystal forms of 

15 tNS3; NS3-like polypeptide/NS4A-l i ke peptide, 

preferably tNS3/sNS4A; or complexes comprising any of 
the above. 

The structure coordinates are also 
particularly useful to solve the structure of crystal 

20 of NS3-like polypeptide/NS4A-like peptide complexes, 
particularly tNS3/sNS4A, co-complexed with a variety 
chemical entities . This approach enables the 
determination of the optimal sites for interaction 
between chemical entities, including interaction of 

25 candidate NS3 inhibitors with NS3 or the NS3/NS4A 

complex. For example, high resolution X-ray 
diffraction data collected from crystals exposed to 
different types of solvent allows the determination o 
where each type of solvent molecule resides. Small 

30 molecules that bind tightly to these sites can then b 

designed and synthesized and tested for their NS3 
inhibition activity. 
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All of the complexes referred to above may be 
studied using well-known X-ray diffraction techniques 
and may be refined versus 1.5-3 A resolution X-ray data 
to an R value of about 0.20 or less using computer 
5 software, such as X-PLOR [Yale University, ©1992, 

distributed by Molecular Simulations, Inc.; see, e.g., 
Blundell & Johnson, supra ; Meth. Enzymol ., vol. 114 & 
115, H. W. Wyckoff et al . , eds., Academic Press 
(1985} ] . This information may thus be used to optimize 

10 known NS3 inhibitors, and more importantly, to design 
new NS3 inhibitors. 

In order that this invention be more fully 
understood, the following examples are set forth. 
These examples are for the illustrative purposes only 

15 and are not to be construed as limiting the scope of 
this invention in any way. 



EXAMPLE 1 

Expression and Purification of tN£3 
20 The truncated NS3 serine protease domain 

(tNS3) was cloned from a cDNA of the hepatitis C virus 
H strain [Grakoui, A. et al., "Expression 3nd 
Identification of Hepatitis C Virus Polyprotein 
Cleavage Products", J. Virol. , 67, pp. 1385-1395 
25 (1993) ] . The first 181 amino acids of NS3 (residues 

1027-1207 of the viral polyprotein) have been shown to 
contain the serine protease domain of NS3 that 
processes all four downstream sites of the HCV 
polyprotein [Lin, C, et al., Hepatitis C Virus NS3 
30 Serine Proteinase: Trans-Cleavage Requirements and 

Processing Kinetics", J. Virol. 68, pp. 8147-8157 
(1994b)], so we expressed a (His) g-fusion protein based 
on this tNS3. The plasmid pET-BS ( + ) /HCV/T7-NS3i 81 -His 
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was derived from pTM3/HCV/ 1 027-1207 (NS3].8l) (Id.), by 
using polymerase chain reaction to introduce epitope 
tags and new restriction sites. A T7-tag (ASMTGGQQMG) , 
from the N-terminus of the gene 10 protein of the T7 
5 bacteriophage 

[Tsai, D.E. et al., M Jn Vitro Selection of an RNA 
Epitope Immunologically Cross-Reactive With a Peptide", 
Proc. Natl- Acad. Sci. USA , 89, pp. 8864-8868 (1992)], 
was placed at the N-terminus of the tNS3 domain. Two 
10 linker residues (GS) were placed at the tNS3 C- 

terminus, followed by the (His)g-tag. E.coli 
JM109(DE3) cells, freshly transformed with the pET- 
BS {+) /HCV/T7-NS3i81-His plasmid, were grown at 37 °C in 
complex media supplement with 100 pg/ml ampicillin, in 
15 a 10 L fermentor (Braun) . When the cell density 

reached an OD600 of 3-4 the temperature of the culture 
was rapidly reduced to 30 °C, and induction was 
immediately initiated by the addition of 1 mM IPTG. 
Cells were harvested at 2 h post-induction, and flash 
20 frozen at -70 °C prior to purification. 

The tNS3 was purified from the soluole 
fraction of the recombinant E.coli lysates as follows, 
with all procedures being performed at 4 °C unless 
stated otherwise. Cell paste (75-lQ0g) was resuspended 
25 in 15 volumes of 50 mM HEPES, 0.3 M NaCl , 10* glycerol, 

0.1% fi-octyl glucoside, 2 mM ft-mercaptoethanol , pH 8.0. 
Cells were ruptured using a microf luidizer and the 
homogenate was clarified by centrif ugation at 100,000 x 
g for 30 min. The supernatant was brought to 50 mM 
30 HEPES / 20 mM imidazole, 0.3 M NaCl, 27.5% glycerol, 

0.1% fl-octyl- glucoside, 2 mM fi-mercaptoethanol , pH 
8.0, and applied at 1.0 ml /min to a 7 . 0 ml Ni-Agarose 
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affinity column, equilibrated in the same buffer 
After loading, the column was washed with 10-15 volumes 
of equilibration buffer and the bound proteins were 
eluted with equilibration buffer containing 0 35 M 
imidazole. The protein was then si ze -fractionated on 
two columns in series (each 2.6 cm x 90 cm) pack ed with 
Pharmacxa high resolution S100 resin and equilibrated 
wath 25 mM HEPES, 0 . 3M NaCl, 10% glycerol, 0.1% fi- 
octylglucoside, 2 mM 3-mercaptoethanol , pH 8.0 The 
tNS3 fractions, identified by SDS-PAGE, were pooled and 
concentrated to 1 mg/ml using a Amicon Centriprep-10 
and stored at -70°C. The tNS3 was thawed slowly on ice 
and the NS4A peptide (dissolved in the size-exclusion " 
chromatography buffer) was added at a tNS3 : NS4A-peptide 
molar ratio of 1-9 

tio of 1.2. The sample was then diluted 2 5- 

fold with 15 mM MES, 0.5 M NaCl, 20 mM 15- 
mercaptoethanol, P H6.5, and concentrated to ~2 ml (~ o 
»g/»l) by ultrafiltration. The sample was then dUuted 
2-fold with the P H 6.5 buffer and concentrated again to 
-2 ml. This dilution process was repeated until it 
gave a >40-fold dilution of the original buffer 
constituents The protein sample was then concentrated 

13.0 mg/ml and centrifuged at -300,000 x g for 20 
-in at 4 °c. Concentrations of the pure tNS3 and 
tNS3/4A complex were determined by uv absorption 
spectroscopy, using a molar absorption coefficient 
(^280> of 17,700 M-l.cm-1. 



EXAMPLE 2 

4A Peptide Sy n thesis and Purifi^ ^, 

The HCV NS4A peptide was synthesized to span 
rescues Gly21 to Pro39 of the viral 
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1678 to 1696 of the HCV polyprotein) , which 
incorporates the essential region reported to be 
essential for NS3 stimulation [Lin, C. et al . "A 
Central Region in the Hepatitis C Virus NS4A Protein 
5 Allows Formation of an Active NS3-NS4A Serine 

Proteinase Complex In Vivo and In Vitro ", J. Virol. 69, 
pp. 4373-4380 (1995)]. Lysine residues were added to 
the termini to assist aqueous solubility, and a serine 
residue was substituted for Cys22 (residue 1679 of the 
10 polyprotein of the HCV H strain) . The peptide (H- 

KKGSWIVGRIVLSGKPAIIPKK-OH - TFA salt) was prepared by 
the solid-phase peptide synthesis (Applied Biosystems 
433A) beginning with N a -Fmoc, N e -Boc-Lys Wang resin. 
N a -Fmoc-protected amino acids were added sequentially 
15 using HBTU (2- (lH-benzotriazol-l-yl) 1/ 1/ 3, 3- 

tetramethyluronium hexaf luorophosphate) with HOBt (1- 
hydroxybenzotriazole hydrate) as coupling agents in N- 
methylpyrrolidinone. Cleavage from the resin and 
global deprotection were accomplished with 95% 
20 trifluoroacetic acid and 5% water at room temperature 

for 1.5 hr (15 ml/ g resin) . The peptide was purified 
by preparative HPLC on a Waters Delta Pak C18, 15 yra, 
30CA column (30 mm x 300 mm) eluting with a linear 
gradient of acetonitrile (15-40%) in 0.1% aqueous 
25 trifluoroacetic acid over 35 min (flow rate of 22 

ml/min) . Peptide purity was confirmed by analytical 
HPLC. The sequence was confirmed by direct N-terminal 
sequence analysis and matrix-assisted laser desorption 
mass spectrometry (Kratos MALDI I), which showed the 
30 correct (M + H) + and (M + Na) + molecular ions. 
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EXAMPLE 3 

Crystallizatio n and Data Collection 
Crystals of the tNS3/NS4A complex were grown 
by hanging-drop vapor diffusion over a reservoir of 0 1 
5 M MES, 1.8 M NaCI, 0.1 M sodium/potassium phosphate, 10 
mM B-mercaptoethanol, pH 6.5. The crystals grew over 
the course of 2-3 weeks, to final dimensions of about 
0.1 x 0.1 x 0.25 mm. The rhombohedral crystals used in 
this study belonged to space group R32, with unit cell 
dimensions a=b=225.0A, and c=75.5A, and contained two 
tNS3/NS4A complexes per asymmetric unit. 

Statistics for data collection, heavy atom 
refinement, and crystallographic refinement are given 
in Table 1. All heavy atom soaks were done in hanging- 
15 drops over the same reservoir as used for 

crystallization. Crystals were transferred to a 
stabilizing solution (50 mM MES , 2.0 M NaCI, 0.1 M 
sodium/potassium phosphate, 10 mM fi-mercaptoethanol , 
and 20% glycerol, pH 6.2) and then frozen in a dry 
nitrogen gas stream at 100 K (Molecular Structure 
Corp., Houston, TX) for data collection. Data was 
acquired by oscillation photography on a Rigaku R-AXIS 
I1C phosphor imaging area detector mounted on a Rigaku 
RU200 rotating anode generator (MSC) , operating at 50kV 
25 and 100mA. Measured intensities were integrated, 

scaled, and merged using the HKL software package {Z. 
Otwinowski and W. Minor) . 
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EXAMPLE 4 

Phasing, M odel Building and Refinement 
Heavy atom positions were located by 
inspection and confirmed with difference Fourier 
syntheses. Heavy atom parameters were refined and 
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phases computed to 3 . lA using the program PHASES 
[Furey, W. and Swaminathan, S . "PHASES-95 : a program 
package for the processing and analysis of diffraction 
data from macromolecules", Meth« Enzymol . , (1996) . MIR 
5 phases were improved and extended to 2 . 7A by cycles of 
solvent flattening [Wang, B.C., "Resolution of Phase 
Ambiguity in Macromolecular Crystallography", Methods 
in Enzymol . 115, pp. 90-112 (1985)] combined with 
histogram matching [Zhang, K.Y.J, and Main, P., "The 
10 Use of Sayre's Equation With Solvent Flattening and 

Histogram Matching for Phase Extension and Refinement 
of Protein Structures", Acta Crystallogr . , A46, pp. 
377-381 (1990)] using the CCP4 crys tallographic package 
(Collaborative Computation Project, 1994) . The 
15 resulting electron density map displayed nearly 

continuous density for the protein backbone as well as 
strong side chain density. Approximately 80% of the 
model could be unambiguously built into this map 
(QUANTA 4.1, Molecular Simulations), and a single round 
20 of simulated annealing refinement in X-PLOR [Brunger, 

A. T . , "X-PLOR : A System for X-Ray Crystallography and 
NMR", New Haven, Connecticut: Department of Molecular 
Biophysics and Biochemistry, Yale University (1993) ] 
brought the R-factor to 29% and free R value to 33% 
25 [Brunger, A. T . , "Free R Value: A Novel Statistical 

Quantity for Assessing the Accuracy of Crystal 
Structures", Nature , 355, pp. 472-4*75 (1992)]. The 
remainder of the model was built and refined in several 
steps, by first extending the resolution to 2 . 5A and 
30 then adding well-ordered water molecules. A final 

round of positional and individual temperature factor 
refinement brought the R-factor to 21.6% (free R value 
26.1%) for 26,652 reflections between 6.0 and 2 . 5A 
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(F>lsF). The current model consisted of tNS3 residues 
1055-1206 and NS4A residues 1678-1693 in complex A, and 
tNS3 residues 1028-1206 and NS4A residues 1678-1696 for 
complex B (polyprotein numbering, with 2 zinc atoms 
and 130 water molecules. A Ramachandran plot for the 
final model contained 91% of the residues in the most 
favored regions and 0% in disallowed or generously- 
allowed regions. The rms deviations from ideality were 
C007A for bond lengths and 1.47° for bond angles. 

While we have described a number of 
embodiments of this invention, it is apparent that our 
basic examples may be altered to provide other 
embodiments which utilize the products and processes of 
this invention. Therefore, it will be appreciated that 
the scope of this invention is to be defined by the 
appended claims rather than by the specific embodiments 
which have been represented by way of example. 
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CLAIMS 

We claim: 

1. A composition comprising a HCV NS3-like 
5 polypeptide complexed with a NS4A-like peptide. 

2. The composition according to claim 1, 
wherein the HCV NS3-like polypeptide is a NS3 protease 
domain polypeptide or a NS3 protease domain-like 

10 polypeptide . 

3 . The composition according to claim 1 , 
wherein the HCV NS3-like polypeptide is tNS3. 

15 4. The composition according to any one of 

claims 1 to 3, wherein the NS4A-like peptide is H- 
KKGSWTVGRIVLSGKPAI IPKK-OH . 

5. A crystal comprising a HCV NS3-like 
20 polypeptide complexed with a NS4A-like peptide. 

6. The crystal according to claim 5, wherein 
the HCV NS3-like polypeptide is a NS3 protease domain 
polypeptide or a NS3 protease domain-like polypeptide. 



7. The crystal according to claim 5, wherein 
the HCV NS3-like polypeptide is tNS3. 



30 



8. The crystal according to any one of 
claims i to 3, wherein the NS4A-like peptide is H- 
KKGSWIVGRI VLSGKPAI I PKK-OH . 
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9. The crystal according to claim 5, 
additionally comprising an inhibitor of HVC NS3. 

10. A machine-readable data storage medium, 

5 comprising a data storage material encoded with machine 
readable data, wherein the data is defined by the 
structure coordinates of a tNS3/sNS4A complex according 
to Figure 3, or a homologue of said complex, wherein 
said homologue comprises backbone atoms that have a 
10 root mean square deviation from the backbone atoms of 
the complex of not more than 1 . 5A 

11. The machine-readable data storage 
medium, according to claim 10, wherein said molecule or 
molecular complex is defined by the set of structure 
coordinates for tNS3/sNS4A according to Figure 3, or a 
homologue of said molecule or molecular complex, said 
homologue having a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1 5 
20 A. 



12. A machine-readable data storage medium 
comprising a data storage material encoded with a first 
set of machine readable data comprising a Fourier 
transform of at least a portion of the structural 
coordinates for tNS3/sNS4A according to Figure 3; 
which, when combined with a second set of machine 
readable data comprising an X-ray diffraction pattern 
of a molecule or molecular complex of unknown 
structure, using a machine programmed with instructions 
for using said first set of data and said second set of 
data, can determine at least a portion of the structure 
coordinates corresponding to the second set of machine 
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readable data, said first set of data and said second 
set of data. 

13. A method of obtaining structural 

5 information about a molecule or a molecular complex of 

unknown structure by using the structure coordinates 
set forth in Figure 3, comprising the steps of: 

a. generating X-ray diffraction data from 
said crystallized molecule or molecular complex; 
10 b. applying at least a portion of the 

structure coordinates set forth in Figure 3 to said 
X-ray diffraction pattern to generate a 
three-dimensional electron density map of at least a 
portion of the molecule or molecular complex. 

15 

14. The method according to claim 13, 
wherein the molecule or molecular complex of unknown 
structure comprises a polypeptide selected from a NS3- 
like polypeptide in complex with a NS4A-like peptide. 
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# 


X 


Y 


Z 


OCC 
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ATOM 


1 


N 


VAL 


1055 


94.186 


42.391 


51.819 


1.00 


32.85 


ATOM 


2 


CA 


VAL 


1055 


92.743 


42.519 


51.462 


1.00 


30.82 


ATOM 


3 


CB 


VAL 


1055 


92.435 


41.883 


50.083 


1.00 


30.41 


ATOM 


4 


CG1 


VAL 


1055 


90.953 


41.986 


49.767 


1.00 


31.38 


ATOM 


5 


CG2 


VAL 


1055 


93.232 


42.576 


48.999 


1.00 


30.52 


ATOM 


6 


C 


VAL 


1055 


91.928 


41.817 


52.527 


1.00 


30.32 


ATOM 


7 


O 


VAL 


1055 


92.271 


40.715 


52.954 


1.00 


31.39 


ATOM 


8 


N 


GLU 


1056 


90.869 


42.481 


52.975 


1.00 


30.38 


ATOM 


9 


CA 


GLU 


1056 


89.985 


41.933 


53.994 


1.00 


28.82 


ATOM 


10 


CB 


GLU 


1056 


89.909 


42.877 


55.191 


1.00 


28.56 


ATOM 


11 


CG 


GLU 


1056 


91 .249 


43.090 


55.856 


1.00 


33.65 


ATOM 


12 


CD 


GLU 


1056 


91.183 


44.066 


57.007 


1.00 


37.88 


ATOM 


13 


OE1 


GLU 


1056 


91.610 


43.695 


58.120 


1.00 


40.68 


ATOM 


14 


OE2 


GLU 


1056 


90.713 


45.204 


56.796 


1.00 


40.04 


ATOM 


15 


C 


GLU 


1056 


88.590 


41.675 


53.442 


1.00 


26.54 


ATOM 


16 


O 


GLU 


1056 


88.102 


42.417 


52.591 


1.00 


26.63 


ATOM 


17 


N 


GLY 


1057 


87.970 


40.602 


53.914 


1.00 


24.45 


ATOM 


18 


CA 


GLY 


1057 


86.633 


40.261 


53.478 


1.00 


22.13 


ATOM 


19 


C 


GLY 


1057 


85.614 


40.570 


54.553 


1.00 


21.95 


ATOM 


20 


O 


GLY 


1057 


85.918 


40.510 


55.742 


1.00 


23.57 


ATOM 


21 


N 


GLU 


1058 


84.406 


40.924 


54.140 


1.00 


20.53 


ATOM 


22 


CA 


GLU 


1058 


83.352 


41.236 


55.084 


1.00 


18.51 


ATOM 


23 


CB 


GLU 


1058 


82.408 


42.295 


54.514 


1.00 


18.38 


ATOM 


24 


CG 


GLU 


1058 


83.060 


43.597 


54.096 


1.00 


18.99 


ATOM 


25 


CD 


GLU 


1058 


83.531 


43.593 


52.650 


1.00 


23.27 


ATOM 


26 


OE1 


GLU 


1058 


83.746 


42.504 


52.083 


1.00 


24.96 


ATOM 


27 


OE2 


GLU 


1058 


83.701 


44.686 


52.067 


1.00 


26.97 


ATOM 


28 


C 


GLU 


1058 


82.580 


39.954 


55.338 


1.00 


19.61 


ATOM 


29 


O 


GLU 


1058 


81.988 


39.770 


56.399 


1.00 


21.48 


ATOM 


30 


N 


VAL 


1059 


82.604 


39.062 


54.354 


1.00 


18.78 


ATOM 


31 


CA 


VAL 


1059 


81.899 


37.789 


54.428 


1.00 


18.22 


ATOM 


32 


CB 


VAL 


1059 


80.990 


37.583 


53.170 


1.00 


16.33 


ATOM 


33 


CG1 


VAL 


1059 


80.286 


36.249 


53.225 


1.00 


13.76 


ATOM 


34 


CG2 


VAL 


1059 


79.977 


38.702 


53.048 


1.00 


12.68 


ATOM 


35 


C 


VAL 


1059 


82.903 


36.644 


54.491 


1.00 


20.89 


ATOM 


36 


O 


VAL 


1059 


83.826 


36.571 


53.676 


1.00 


20.81 


ATOM 


37 


N 


GLN 


1060 


82.737 


35.758 


55.465 


1.00 


21.48 


ATOM 


38 


CA 


GLN 


1060 


83.629 


34.620 


55.582 


1.00 


22.32 
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39 


CB 


GLN 


1060 


84.405 


34.633 


56.908 


1.00 


23.40 


ATOM 


40 


CG 


GLN 


1060 


83.662 


35.176 


58.104 


1.00 


24.97 


ATOM 


41 


CD 


GLN 


1060 


83.707 


36.692 


58.201 


1.00 


25.79 


ATOM 


42 


OE1 


GLN 


1060 


84.519 


37.351 


57.553 


1.00 


27.09 


ATOM 


43 


NE2 


GLN 


1060 


82.842 


37.252 


59.031 


1.00 


25.24 


ATOM 
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C 


GLN 


1060 


82.863 


33.325 


55.410 


1.00 


23.40 
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45 


O 


GLN 


1060 


81.735 


33.202 


55.880 


1.00 


25.77 
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83.457 
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1.00 
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1061 
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31 .086 
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1.00 


23.08 
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1061 


83.399 


30.530 


53.052 


1.00 


23.13 
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49 


CG2 


ILE 


1061 


82.659 


29.258 


52.698 


1.00 


20.59 


ATOM 


50 


CG1 


ILE 


1061 


83.219 


31.584 


51.951 


1.00 


23.17 


ATOM 


51 


CD1 


ILE 


1061 


83.949 


31 .275 


50.692 


1.00 


21.52 


ATOM 


52 


C 


ILE 


1061 


83.335 


30.173 


55.517 


1.00 


24.42 


ATOM 


53 


o 


ILE 


1061 


84.498 


29.766 


55.574 


1.00 


26.06 


ATOM 


54 


N 


VAL 


1062 


82.404 


29.850 


56.404 


1.00 


25.24 


ATOM 


55 


CA 


VAL 


1062 


82.688 


29.028 


57.569 


1.00 


24.13 


ATOM 


56 


CB 


VAL 


1062 


82.085 


29.682 


58.832 


1.00 


21.93 


ATOM 


57 


CG1 


VAL 


1062 


82.568 


31.124 


58.951 


1.00 


20.61 


ATOM 


58 


CG2 


VAL 


1062 


80.578 


29.645 


58.778 


1.00 


18.03 


ATOM 


59 


c 


VAL 


1062 


82.201 


27.587 


57.443 


1.00 


26.46 


ATOM 


60 


o 


VAL 


1062 


81.391 


27.261 


56.573 


1.00 


26.50 


ATOM 


61 


N 


SER 


1063 


82.685 


26.730 


58.332 


1.00 


28.27 


ATOM 


62 


CA 


SER 


1063 


82.312 


25.326 


58.316 


1.00 


29.40 


ATOM 


63 


CB 


SER 


1063 


83.246 


24.539 


57.389 


1.00 


29.43 


ATOM 


64 


OG 


SER 


1063 


83.164 


25.013 


56.054 


1.00 


33.51 


ATOM 


65 


c 


SER 


1063 


82.339 


24.684 


59.696 


1.00 


31.16 


ATOM 


66 


o 


SER 


1063 


83.000 


25.167 


60.623 


1.00 


30.35 


ATOM 


67 


N 


THR 


1064 


81.593 


23.593 


59.811 


1.00 


33.89 


ATOM 


68 


CA 


THR 


1064 


81.497 


22.790 


61 .023 


1.00 


33.81 


ATOM 


69 


CB 


THR 


1064 


80.091 


22.855 


61.651 


1.00 


31.40 


ATOM 


70 


OG1 


THR 


1064 


79.099 


22.536 


60.664 


1.00 


33.92 


ATOM 


71 


CG2 


THR 


1064 


79.825 


24.230 


62.216 


1.00 


29.94 


ATOM 


72 


c 


THR 


1064 


81 .769 


21.383 


60.511 


1.00 


35.92 


ATOM 


73 


o 


THR 


1064 


81.951 


21.200 


59.305 


1.00 


36.71 


ATOM 


74 


N 


ALA 


1065 


81.800 


20.395 


61 .396 


1.00 


38.38 


ATOM 


75 


CA 


ALA 


1065 


82.060 


19.022 


60.975 


1.00 


40.41 


ATOM 


76 


CB 


ALA 


1065 


82.104 


18.114 


62.181 


1.00 


41.57 


ATOM 


77 


c 


ALA 


1065 


81.044 


18.490 


59.966 


1.00 


41.85 


ATOM 


78 


O 


ALA 


1065 


81.357 


17.623 


59.151 


1.00 


41.76 


ATOM 


79 


N 


THR 


1066 


79.827 


19.016 


60.025 


1.00 


43.16 


ATOM 


80 


CA 


THR 


1066 


78.758 


18.559 


59.148 


1.00 


45.82 


ATOM 


81 


CB 


THR 


1066 


77.582 


18.022 


60.003 


1.00 


48.55 


ATOM 


82 


OG1 


THR 


1066 


77.141 


19.047 


60.914 


1.00 


51.57 


ATOM 


83 


CG2 


THR 


1066 


78.009 


16.795 


60.791 


1.00 


51.00 


ATOM 


84 


C 


THR 


1066 


78.147 


19.613 


58.224 


1.00 


46.64 


ATOM 


85 


O 


THR 


1066 


77.374 


19.265 


57.323 


1.00 


48.12 


ATOM 


86 


N 


GLN 


1067 


78.466 


20.887 


58.431 


1.00 


45.58 


ATOM 


87 


CA 


GLN 


1067 


77.829 


21.923 


57.636 


1.00 


43.33 


ATOM 


88 


CB 


GLN 


1067 


76.635 


22.429 


58.438 


1.00 


44.54 


ATOM 


89 


CG 


GLN 


1067 


75.606 


23.244 


57.682 


1.00 


48.92 


ATOM 


90 


CD 


GLN 


1067 


74.377 


23.595 


58.535 


1.00 


51.71 


ATOM 


91 


OE1 


GLN 


1067 


73.519 


24.380 


58.113 


1.00 


53.17 


ATOM 


92 


NE2 


GLN 


1067 


74.297 


23.023 


59.738 


1.00 


49.37 


ATOM 


93 


C 


GLN 


1067 


78.720 


23.091 


57.269 


1.00 


40.47 


ATOM 


94 


O 


GLN 


1067 


79.567 


23.498 


58.053 


1.00 


41.29 


ATOM 


95 


N 


THR 


1068 


78.523 


23.612 


56.063 


1.00 


37.11 


ATOM 


96 


CA 


THR 


1068 


79.261 


24.767 


55.574 


1.00 


30.70 
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ATOM 


97 


CB 


THR 


1068 


80.113 


24.436 


54.331 


1.00 


29.76 


ATOM 


98 


OG1 


THR 


1068 


81 .008 


25.521 


54.062 


1.00 


30.04 


ATOM 


99 


CG2 


THR 


1068 


79.241 


24.184 


53.113 


1.00 


30.99 


ATOM 


100 


C 


THR 


1068 


78.236 


25.843 


55.234 


1.00 


28.68 


ATOM 


101 


O 


THR 


1068 


77.192 


25.565 


54.644 


1.00 


27.99 


ATOM 


102 


N 


PHE 


1069 


78.517 


27.063 


55.659 


1.00 


26.46 


ATOM 


103 


CA 


PHE 


1069 


77.631 


28.186 


55.420 


1.00 


23.03 


ATOM 


104 


CB 


PHE 


1069 


76.499 


28.199 


56.450 


1.00 


22.57 


ATOM 


105 


CG 


PHE 


1069 


76.955 


27.964 


57.854 


1.00 


22.39 


ATOM 


106 


CD1 


PHE 


1069 


77.01 1 


29.014 


58.759 


1.00 


22.57 


ATOM 


107 


CD2 


PHE 


1069 


77.351 


26.699 


58.271 


1.00 


22.33 


ATOM 


108 


CE1 


PHE 


1069 


77.458 


28.808 


60.057 


1.00 


23.82 


ATOM 


109 


CE2 


PHE 


1069 


77.800 


26.485 


59.566 


1.00 


22.79 


ATOM 


110 


CZ 


PHE 


1069 


77.855 


27.543 


60.461 


1.00 


23.30 


ATOM 


111 


C 


PHE 


1069 


78.456 


29.472 


55.447 


1.00 


22.55 


ATOM 


112 


O 


PHE 


1069 


79.670 


29.421 


55.249 


1.00 


21.97 


ATOM 


113 


N 


LEU 


1070 


77.832 


30.607 


55.734 


1.00 


19.95 


ATOM 


114 


CA 


LEU 


1070 


78.562 


31.866 


55.734 


1.00 


19.27 


ATOM 


115 


CB 


LEU 


1070 


78.091 


32.748 


54.571 


1.00 


19.20 


ATOM 


116 


CG 


LEU 


1070 


78.060 


32.230 


53.135 


1.00 


16.10 


ATOM 


117 


CD1 


LEU 


1070 


77.451 


33.307 


52.257 


1.00 


13.03 


ATOM 


118 


CD2 


LEU 


1070 


79.445 


31.859 


52.670 


1.00 


15.83 


ATOM 


119 


C 


LEU 


1070 


78.347 


32.649 


57.009 


1.00 


18.16 


ATOM 


120 


O 


LEU 


1070 


77.443 


32.345 


57.781 


1.00 


20.67 


ATOM 


121 


N 


ALA 


1071 


79.186 


33.661 


57.210 


1.00 


16.24 


ATOM 


122 


CA 


ALA 


1071 


79.091 


34.565 


58.349 


1.00 


17.10 


ATOM 


123 


CB 


ALA 


1071 


80.142 


34.249 


59.377 


1.00 


19.06 


ATOM 


124 


C 


ALA 


1071 


79.340 


35.935 


57.750 


1.00 


18.14 


ATOM 


125 


O 


ALA 


1071 


79.958 


36.030 


56.693 


1.00 


19.55 


ATOM 


126 


N 


THR 


1072 


78.839 


36.981 


58.396 


1.00 


17.51 


ATOM 


127 


CA 


THR 


1072 


78.999 


38.345 


57.909 


1.00 


16.67 


ATOM 


128 


CB 


THR 


1072 


77.679 


38.890 


57.335 


1.00 


14.90 


ATOM 


129 


OG1 


THR 


1072 


77.249 


38.048 


56.269 


1.00 


19.26 


ATOM 


130 


CG2 


THR 


1072 


77.852 


40.303 


56.799 


1.00 


13.79 


ATOM 


131 


C 


THR 


1072 


79.410 


39.262 


59.049 


1.00 


19.92 


ATOM 


132 


O 


THR 


1072 


78.785 


39.264 


60.112 


1.00 


22.53 


ATOM 


133 


N 


CYS 


1073 


80.470 


40.032 


58.840 


1.00 


20.00 


ATOM 


134 


CA 


CYS 


1073 


80.916 


40.960 


59.856 


1.00 


19.38 


ATOM 


135 


CB 


CYS 


1073 


82.388 


41.267 


59.691 


1.00 


17.15 


ATOM 


136 


SG 


CYS 


1073 


83.433 


39.961 


60.265 


1.00 


21.59 


ATOM 


137 


C 


CYS 


1073 


80.144 


42.240 


59.705 


1.00 


20.61 


ATOM 


138 


O 


CYS 


1073 


80.125 


42.821 


58.623 


1.00 


21.53 


ATOM 


139 


N 


ILE 


1074 


79.470 


42.654 


60.770 


1.00 


23.02 


ATOM 
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CA 


ILE 


1074 


78.706 


43.903 


60.793 


1.00 


24.61 
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CB 
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1074 


77.189 


43.676 


60.592 


1.00 


24.97 
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1074 


76.426 


44.967 


60.844 


1.00 


27.22 


ATOM 
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CG1 
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1074 


76.904 


43.174 


59.178 


1.00 


23.86 


ATOM 
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1074 


75.491 


42.685 


58.985 


1.00 


25.93 


ATOM 
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C 


ILE 


1074 


78.943 


44.491 


62.176 


1.00 


26.19 


ATOM 
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1074 


78.833 


43.790 


63.185 


1.00 


28.47 


ATOM 
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N 


ASN 


1075 


79.320 


45.760 


62.222 


1.00 


26.56 
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CA 
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79.593 


46.430 
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27.15 
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CB 
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1075 


78.297 


46.703 


64.261 


1.00 


28.70 


ATOM 
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CG 


ASN 


1075 


77.415 


47.756 


63.592 


1.00 


30.77 


ATOM 
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OD1 


ASN 


1075 


77.829 


48.418 


62.639 


1.00 


33.91 
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ASN 
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76.192 


47.915 


64.094 
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30.24 
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ASN 


1075 


80.599 


45.671 


64.368 


1.00 


28.13 


ATOM 
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ASN 


1075 


80.388 


45.504 


65.569 


1.00 


27.91 


ATOM 


155 


N 


GLY 


1076 


81.687 


45.206 


63.760 


1.00 


28.00 


ATOM 


156 


CA 


GLY 


1076 


82.723 


44.506 


64.502 


1.00 


27.37 


ATOM 
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C 


GLY 


1076 


82.41 1 


43.101 


64.970 


1.00 


29.01 


ATOM 


158 


O 


GLY 


1076 


83.264 


42.433 


65.567 


1.00 


28.27 


ATOM 
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N 


VAL 


1077 


81.205 


42.631 


64.678 


1.00 


29.05 


ATOM 


160 


CA 


VAL 


1077 


80.801 


41 .298 


65.084 


1.00 


27.37 


ATOM 


161 


CB 


VAL 


1077 


79.453 


41 .339 


65.821 


1.00 


26.61 


ATOM 


162 


CG1 


VAL 


1077 


79.062 


39.956 


66.292 


1.00 


27.45 


ATOM 


163 


CG2 


VAL 


1077 


79.536 


42.281 


66.995 


1.00 


27.61 


ATOM 


164 


C 


VAL 


1077 


80.688 


40.392 


63.864 


1.00 


27.94 


ATOM 


165 


o 


VAL 


1077 


80.387 


40.852 


62.759 


1.00 


29.66 


ATOM 


166 


N 


CYS 


1078 


80.989 


39.114 


64.062 


1.00 


26.17 


ATOM 


167 


CA 


CYS 


1078 


80.910 


38.114 


63.01 1 


1.00 


23.77 


ATOM 


168 


CB 


CYS 


1078 


82.097 


37.158 


63.101 


1.00 


23.73 


ATOM 


169 


SG 


CYS 


1078 


82.221 


35.981 


61.746 


1.00 


23.84 


ATOM 


170 
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CYS 


1078 


79.610 


37.378 


63.286 


1.00 


23.02 


ATOM 
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1078 


79.533 


36.557 


64.200 


1.00 


24.11 


ATOM 


172 
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TRP 


1079 


78.576 


37.726 


62.534 


1.00 


21.24 


ATOM 
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CA 


TRP 


1079 


77.260 


37.133 


62.698 


1.00 


19.55 


ATOM 


174 


CB 


TRP 


1079 


76.187 


38.178 


62.406 


1.00 


19.32 


ATOM 


175 


CG 


TRP 


1079 


76.340 


39.431 


63.178 


1.00 


21.82 


ATOM 


176 


CD2 


TRP 


1079 


75.766 


39.721 
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23.03 


ATOM 2505 


CA 


SER 


1689 


84.209 


45.213 


46.353 


1.00 


21.96 


ATOM 2506 


CB 


SER 


1689 


84.999 


46.366 


46.968 


1.00 


21.45 


ATOM 2507 


OG 


SER 


1689 


86.151 


45.888 


47.643 


1.00 


24.53 


ATOM 2508 


c 


SER 


1689 


85.031 


44.593 


45.238 


1.00 


22.42 


ATOM 2509 


O 


SER 


1689 


85.306 


45.252 


44.239 


1.00 


25.52 


ATOM 2510 


N 


GLY 


1690 


85.430 


43.337 


45.409 


1.00 


21.46 


ATOM 2511 


CA 


GLY 


1690 


86.242 


42.676 


44.406 


1.00 


21.23 


ATOM 2512 


c 


GLY 


1690 


87.690 


43.130 


44.463 


1.00 


23.65 


ATOM 2513 


O 


GLY 


1690 


88.451 


42.922 


43.521 


1.00 


25.98 


ATOM 2514 


N 


LYS 


1691 


88.082 


43.738 


45.577 


1.00 


23.43 


ATOM 2515 


CA 


LYS 


1691 


89.445 


44.224 


45.746 


1.00 


23.50 


ATOM 2516 


CB 


LYS 


1691 


89.561 


44.980 


47.067 


1.00 


21.39 


ATOM 2517 


CG 


LYS 


1691 


90.907 


45.621 


47.286 


1.00 


21.65 


ATOM 2518 


CD 


LYS 


1691 


90.888 


46.513 


48.507 


1.00 


22.73 


ATOM 2519 


CE 


LYS 


1691 


92.193 


47.269 


48.654 


1.00 


23.13 


ATOM 2520 


NZ 


LYS 


1691 


92.079 


48.318 


49.700 


1.00 


27.11 


ATOM 2521 


C 


LYS 


1691 


90.503 


43.116 


45.680 


1.00 


24.67 



WO 98/11134 












PCT/US97/16182 








FIGURE 3 (CONT.) 










ATOM P5PP 


V-/ 


LTO 


i oy i 


on AAr\ 


4<£.133 


46.424 
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26.46 
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PRO 
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1 ftQO 


y i M/o 


4o.*£49 
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25.07 
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CD 


PRO 


i oy*i 


fH IT i "7 

y i .oi / 


A A OA C 
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43.682 


1.00 


24.13 


ATOM 2525 


CA 


PRO 




no jT/i n 


AO 0~?C\ 


44.598 


1.00 


25.54 


ATOM 2526 


CB 


PRO 


i oy^ 


QO QQQ 

y^.yoo 


AO AE^CZ 


43.143 


1.00 


22.92 


ATOM 2527 


CG 


PRO 




qp Rm 




42.986 


1.00 


23.88 


ATOM 2528 


c 


PRO 


1692 


93.716 


AO 


45.516 


1.00 


27.60 


ATOM P5P9 


o 


PRO 


1692 


93.810 


4o./lD 


46.029 


1.00 


29.48 


ATOM 2530 


N 


ALA 


1693 


94.570 


41 

*f I .DZ.U 


45.770 


1.00 


31.04 


ATOM 2531 


CA 


ALA 


1693 


95.736 


41.833 


46.621 


1.00 


34.86 


ATOM 2532 


CB 


ALA 


1693 


96.112 


40.543 


47.326 


1.00 


35.21 


ATOM 2533 


C 


ALA 


1693 


96.915 


42.354 


45.798 


1.00 


35.37 


ATOM 2534 


O 


ALA 


1693 


96.891 


42.145 


44.559 


1.00 


35.58 



tNS3COORDINATES (ComplexB) 

Atom 





Type 


Resid 


# 


X 


Y 


Z 


OCC 


B 




ATOM 


1118 


N 


PRO 


1028 


74.820 


45.651 


43.675 


1.00 


28.69 


ATOM 


1119 


CD 


PRO 


1028 


75.076 


44.493 


44.550 


1.00 


28.33 


ATOM 


1120 


CA 


PRO 


1028 


74.869 


46.897 


44.441 


1.00 


28.80 


ATOM 


1121 


CB 


PRO 


1028 


74.807 


46.400 


45.883 


1.00 


28.68 


ATOM 1122 


CG 


PRO 


1028 


75.597 


45.138 


45.819 


1.00 


27.37 


ATOM 


1123 


C 


PRO 


1028 


76.163 


47.662 


44.173 


1.00 


28.69 


ATOM 


1124 


O 


PRO 


1028 


77.141 


47.094 


43.666 


1.00 


30.88 


ATOM 


1125 


N 


ILE 


1029 


76.172 


48.950 


44.500 


1.00 


25.65 


ATOM 


1126 


CA 


ILE 


1029 


77.359 


49.763 


44.288 


1.00 


22.21 


ATOM 


1127 


CB 


ILE 


1029 


77.029 


51.251 


44.319 


1.00 


19.66 


ATOM 


1128 


CG2 


ILE 


1029 


78.254 


52.059 


43.933 


1.00 


21.71 


ATOM 


1129 


CG1 


ILE 


1029 


75.883 


51.554 


43.355 


1.00 


19.62 


ATOM 


1130 


CD1 


ILE 


1029 


75.398 


52.960 


43.428 


1.00 


14.82 


ATOM 


1131 


C 


ILE 


1029 


78.383 


49.463 


45.371 


1.00 


23.25 


ATOM 


1132 


o 


ILE 


1029 


78.094 


49.586 


46.559 


1.00 


25.52 


ATOM 


1133 


N 


THR 


1030 


79.561 


49.023 


44.951 


1.00 


23.20 


ATOM 


1134 


CA 


THR 


1030 


80.651 


48.725 


45.860 


1.00 


23.22 


ATOM 


1135 


CB 


THR 


1030 


80.916 


47.207 


45.958 


1.00 


23.28 


ATOM 


1136 


OG1 


THR 


1030 


81.077 


46.659 


44.641 


1.00 


27.79 


ATOM 


1137 


CG2 


THR 


1030 


79.764 


46.508 


46.652 


1.00 


24.32 


ATOM 


1138 


C 


THR 


1030 


81.879 


49.406 


45.277 


1.00 


23.10 


ATOM 


1139 


O 


THR 


1030 


81 .982 


49.562 


44.059 


1.00 


24.23 


ATOM 


1140 


N 


ALA 


1031 


82.821 


49.779 


46.134 


1.00 


20.99 


ATOM 


1141 


CA 


ALA 


1031 


84.024 


50.456 


45.675 


1.00 


18.06 


ATOM 


1142 


CB 


ALA 


1031 


83.776 


51.946 


45.635 


1.00 


19.68 


ATOM 


1143 


C 


ALA 


1031 


85.205 


50.172 


46.579 


1.00 


17.02 


ATOM 


1144 


O 


ALA 


1031 


85.036 


49.858 


47.751 


1.00 


17.50 


ATOM 


1145 


N 


TYR 


1032 


86.404 


50.250 


46.028 


1.00 


16.80 


ATOM 


1146 


CA 


TYR 


1032 


87.597 


50.064 


46.837 


1.00 


18.55 


ATOM 


1147 


CB 


TYR 


1032 


88.127 


48.633 


46.786 


1.00 


18.83 


ATOM 


1148 


CG 


TYR 


1032 


88.710 


48.185 


45.466 


1.00 


15.88 


ATOM 


1149 


CD1 


TYR 


1032 


89.997 


48.545 


45.088 


1.00 


13.44 
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ALA 


1033 


90.567 
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46.944 
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20.61 
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1159 


CB 
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90.539 


53.545 
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18.51 
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92.142 
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46.215 
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94.236 
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21.69 
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94.432 
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27.16 
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1169 


C 
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1034 


95.111 


53.103 


45.955 


1.00 


22.49 


ATOM 


1170 


O 


GLN 


1034 


94.780 


54.005 


45.186 


1.00 


21.85 


ATOM 


1 171 


N 


GLN 


1035 


96.194 


53.156 


46.722 


1.00 


25.39 
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GLN 


1035 


97.151 


54.253 


46.648 


1.00 


26.92 


ATOM 

/» 1 N-Xi VI 
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GLN 


1035 


97.779 


54.498 


48.013 


1.00 


29.66 


ATOM 


1174 


CG 


GLN 


1035 


98.985 


55.414 


47.971 


1.00 


37.10 


ATOM 


1175 


CD 


GLN 


1035 


98.983 


56.413 


49.112 


1.00 


43.12 


ATOM 
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OE1 


GLN 


1035 


98.743 


56.056 


50.271 


1.00 


44.84 


ATOM 
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NE2 


GLN 


1035 


99.238 


57.678 


48.790 


1.00 


45.69 


ATOM 
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GLN 


1035 


98.234 


53.851 


45.655 


1.00 


27.03 


ATOM 


1179 


O 


GLN 


1035 


98.750 


52.741 


45.726 


1.00 


29.15 


ATOM 


1 180 


N 


THR 


1036 


98.582 


54.745 


44.738 


1.00 


26.03 


ATOM 


1181 


CA 


THR 


1036 


99.584 


54.437 


43.735 


1.00 


25.05 


ATOM 


1182 


CB 


THR 


1036 


99.094 


54.813 


42.363 


1.00 


24.90 


ATOM 


1183 


OG1 


THR 


1036 


98.718 


56.192 


42.375 


1.00 


25.74 


ATOM 


1184 


CG2 


—w- • 1 1— * 
THR 


1036 


97.901 


53.967 


41.988 


1.00 


23.85 


ATOM 


1185 


C 


THR 


1036 


100.911 


55. lol 


43.976 


1.00 


26.04 


ATOM 


1186 


o 


THR 


1036 


101.933 


54./ lid 


43.427 


1.00 


27.67 


ATOM 


1187 


N 


ARG 


1037 


100.895 




44.719 


1.00 


25.73 


ATOM 


1188 


CA 


ARG 


1037 


102.137 


56.928 


45.023 


1.00 


25.76 


ATOM 


1189 


CB 


ARG 


1037 


102.596 


57.805 


43.873 


1.00 


25.31 


ATOM 


1190 


CG 


ARG 


1037 


101.623 


P Ct O A A 

58.844 


43.446 


1.00 


27.06 


ATOM 


1191 


CD 


ARG 


1037 


102.326 


59.847 


42.575 


1.00 


27.29 


ATOM 


1192 


NE 


ARG 


1037 


103.303 


59.215 


41.696 


1.00 


25.59 


ATOM 


1193 


CZ 


ARG 


1037 


103.077 


58.933 


40.421 


1.00 


27.51 


ATOM 


1194 


NH1 


ARG 


1037 


101.893 


59.223 


39.877 


1.00 


27.27 


ATOM 


1195 


NH2 


ARG 


1037 


104.048 


58.402 


39.685 


1.00 


23.57 


ATOM 


1196 


C 


ARG 


1037 


102.088 


57.727 


46.303 


1.00 


26.18 


ATOM 


1197 


o 


ARG 


1037 


101.011 


58.021 


46.813 


1.00 


26.32 


ATOM 


1198 


N 


GLY 


1038 


103.266 


58.059 


46.823 


1.00 


26.76 
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1038 
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28.78 
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34.94 
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104.045 


63.977 


51.184 
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39.14 
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CD1 


LEU 


1039 


103.634 


63.744 


52.648 


1.00 


41.57 
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CD2 


LEU 


1039 


105.127 


65.038 


51 .079 


1.00 


41.24 
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LEU 


1039 


105.342 


62.782 


48.214 


1.00 


30.11 
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O 


LEU 


1039 


105.161 


63.647 


47.354 


1.00 


29.71 
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N 


LEU 


1040 


106.495 


62.136 


48.358 


1.00 


28.91 


ATOM 1211 


CA 


LEU 


1040 


107.676 


62.463 


47.568 


1.00 


27.58 


ATOM 1212 


CB 


LEU 


1040 


108.878 


61.685 


48.098 


1.00 


28.69 


ATOM 1213 


CG 


LEU 


1040 


110.213 


62.413 


47.959 


1.00 


30.89 


ATOM 1214 


CD1 


LEU 


1040 


110.195 


63.632 


48.870 


1.00 


32.42 


ATOM 1215 


CD2 


LEU 


1040 


111.358 


61.492 


48.325 


1.00 


30.06 


ATOM 1216 


C 


LEU 


1040 


107.524 


62.221 


46.061 


1.00 


26.74 


ATOM 1217 


O 


LEU 


1040 


107.899 


63.068 


45.242 


1.00 


28.09 


ATOM 1218 


N 


GLY 


1041 


106.994 


61.062 


45.695 


1.00 


24.50 


ATOM 1219 


CA 


GLY 


1041 


106.813 


60.750 


44.291 


1.00 


23.28 


ATOM 1220 


C 


GLY 


1041 


105.798 


61.679 


43.667 


1 00 


23 92 


ATOM 1221 


O 


GLY 


1041 


105.922 


62.054 


42.498 


1.00 


22.23 


ATOM 1222 


N 


CYS 


1042 


104.788 


62.050 


44.449 


1.00 


23.15 


ATOM 1223 


CA 


CYS 


1042 


103.751 


62.955 


43.981 


1.00 


23.53 


ATOM 1224 


CB 


CYS 


1042 


102.688 


63.138 


45.060 


1.00 


25.84 


ATOM 1225 


SG 


CYS 


1042 


101.422 


64.376 


44.680 


1.00 


27.73 


ATOM 1226 


C 


CYS 


1042 


104.398 


64.289 


43.627 


1.00 


23.74 


ATOM 1227 


O 


CYS 


1042 


104.253 


64.771 


42.507 


1.00 


25.62 


ATOM 1228 


N 


ILE 


1043 


105.183 


64.835 


44.555 


1.00 


23.35 


ATOM 1229 


CA 


ILE 


1043 


105.878 


66.110 


44.350 


1.00 


20.45 


ATOM 1230 


CB 


ILE 


1043 


106.720 


66.484 


45.593 


1.00 


20.32 


ATOM 1231 


CG2 


ILE 


1043 


107.437 


67.810 


45.373 


1.00 


20.94 


ATOM 1232 


CG1 


ILE 


1043 


105.815 


66.571 


46.831 


1.00 


18.68 


ATOM 1233 


CD1 


ILE 


1043 


106.557 


66.802 


48.145 


1.00 


18.48 


ATOM 1234 


C 


ILE 


1043 


106.765 


66.086 


43.103 


1.00 


20.27 


ATOM 1235 


O 


ILE 


1043 


106.645 


66.952 


42.242 


1.00 


23.53 


ATOM 1236 


N 


ILE 


1044 


107.622 


65.077 


42.981 


1,00 


20.30 


ATOM 1237 


CA 


ILE 


1044 


108.502 


64.965 


41.817 


1.00 


19.23 


ATOM 1238 


CB 


ILE 


1044 


109.370 


63.680 


41.878 


1.00 


18.89 


ATOM 1239 


CG2 


ILE 


1044 


110.205 


63.532 


40.616 


1.00 


15.12 


ATOM 1240 


CG1 


ILE 


1044 


110.276 


63.720 


43.108 


1.00 


17.13 


ATOM 1241 


CD1 


ILE 


1044 


110.979 


62.422 


43.383 


1.00 


19.28 


ATOM 1242 


C 


ILE 


1044 


107.673 


64.942 


40.541 


1 .00 


20.68 


ATOM 1243 


O 


ILE 


1044 


107.921 


65.721 


39.621 


1.00 


21.41 


ATOM 1244 
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THR 


1045 


106.647 


64.094 


40.513 


1.00 


23.02 


ATOM 1245 


CA 


THR 
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105.779 


63.968 


39.340 


1.00 


22.82 
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CB 


THR 


1045 


104.744 


62.848 


39.515 
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22.67 
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OG1 


THR 
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105.415 


61.631 


39.874 
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23.02 
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THR 


1045 


103.985 
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21.21 
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THR 
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105.044 


65.260 
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23.69 


ATOM 1250 


O 


THR 


1045 


104.773 


65.583 


37.903 


1.00 


26.48 


ATOM 1251 


N 


SER 


1046 


10/i 750 


66.013 


40.106 


1.00 


23.58 



WO 98/11134 



FIGURE 3'(CONT.) 



PCT/US97/16182 



ATOM 


1252 


CA 


SER 


1046 


104.043 


67.273 


39.960 


1.00 


24.58 


ATOM 


1253 


CB 


SER 


1046 


103.691 


67.852 


41.331 


1.00 


25.34 


ATOM 


1254 


OG 


SER 


1046 


102.926 


69.034 


41.195 


1.00 


26.40 


ATOM 

^A 1 V l 


1255 


c 


SER 


1046 


104.895 


68.264 


39.205 


1.00 


25.37 


ATOM 


1256 


o 


SER 


1046 


104.397 


69.000 


38.355 
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LEU 


1047 
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68.281 


39.521 


1.00 


25.73 


ATOM 
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CA 


LEU 
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1047 


107.094 


69.204 


38.873 


1.00 


26.65 
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LEU 


1047 
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69.450 
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26.72 
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LEU 
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